
FIGURE 1 

cDNA sequence of wild type amFP486 

ATGGCTCTTTCAAACAAGTTTATCGGAGATGACATGAAAATGACCTACCATATGGATG 

GCTGTGTCAATGGGCATTACTTTACCGTCAAAGGTGAAGGCAACGGGAAGCCATACGA 

AGGGACGCAGACCTCGACTTTTAAAGTCACCATGGCCAACGGTGGGCCCCTTGCATTC 

TCCTTTGACATACTATCTACAGTGTTCAAGTATGGAAATCGATGCTTTACTGCGTATC 

CTACCAGTATGCCCGACTATTTCAAACAAGCATTTCCTGACGGAATGTCATATGAAAG 

GACTTTTACCTATGAAGATGGAGGAGTTGCTACAGCCAGTTGGGAAATAAGCCTTAAA 

GGCAACTGCTTTGAGCACAAATCCACGTTTCATGGAGTGAACTTTCCTGCTGATGGAC 

CTGTGATGGCGAAGATGACAACTGGTTGGGACCCATCTTTTGAGAAAATGACTGTCTG 

CGATGGAATATTGAAGGGTGATGTCACCGCGTTCCTCATGCTGCAAGGAGGTGGCAAT 

TACAGATGCCAATTCCACACTTCTTACAAGACAAAAAAACCGGTGACGATGCCACCAA 

ACCATGCGGTGGAACATCGCATTGCGAGGACCGACCTTGACAAAGGTGGCAACAGTGT 

TCAGCTGACGGAGCACGCTGTTGCACATATAACCTCTGTTGTCCCTTTC (SEQ ID 

NO : 0 1 ) 

amino acid sequence of wild type amFP486 



MALSNKFIGD DMKMTYHMDG 
DILSTVFKYG NRCFTAYPTS 
EHKSTFHGVN FPADGPVMAK 
TSYKTKKPVT MPPNHAVEHR 
(SEQ ID NO: 02) 



CVNGHYFTVK GEGNGKPYEG 
MPDYFKQAFP DGMS YERTFT 
MTTGWDPSFE KMTVCDG ILK 
IARTDLDKGG NSVQLTEHAV 



TQTSTFKVTM ANGGPLAFSF 
YEDGGVATAS WEISLKGNCF 
GDVTAFLMLQ GGGNYRCQFH 
AHITSWPF 



Figure 2 
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cDNA sequence of wild type CFP4 84 

TATAGGANCATNNGGGNGATTGGGGTCCAAAGCATTGTAACCAACGCAGATAACCCCCAG 
TGGTNTCAAACGCAGANAACGCGGGAACATTGGAAAATTGANTNTTAAGGAGGCAAGGAA 
TCGGGAGTAAAGTTGCGAGAAACTGAAAAAATGAAGTGTAAATTTGTGTTCTGCCTGTCC 
TTCTTGGTCCTCGCCATCACAAACGCGAACATTTTTTTGAGAAACGAGGCTGACTTAGAA 
GAGAAGACATTGAGAATACCAAAAGCTCTAACCACCATGGGTGTGATTAAACCAGACATG 
AAGATTAAGCTGAAGATGGAAGGAAATGTAAACGGGCATGCTTTTGTGATCGAAGGAGAA 
GGAGAAGGAAAGCCTTACGATGGGACACACACTTTAAACCTGGAAGTGAAGGAAGGTGCG 
CCTCTGCCTTTTTCTTACGATATCTTGTCAAACGCGTTCCAGTACGGAAACAGAGCATTG 
ACAAAATACCCAGACGATATAGCAGACTATTTCAAGCAGTCGTTTCCCGAGGGATATTCC 
TGGGAAAGAACCATGACTTTTGAAGACAAAGGCATTGTCAAAGTGAAAAGTGACATAAGC 
ATGGAGGAAGACTCCTTTATCTATGAAATTCGTTTTGATGGGATGAACTTTCCTCCCAAT 
GGTCCGGTTATGCAGAAAAAAACTTTGAAGTGGGAACCATCCACTGAGATTATGTACGTG 
CGTGATGGAGTGCTGGTCGGAGATATTAGCCATTCTCTGTTGCTGGAGGGAGGTGGCCAT 
TACCGATGTGACTTCAAAAGTATTTACAAAGCAAAAAAAGTTGTCAAATTGCCAGACTAT 
CACTTTGTGGACCATCGCATTGAGATCTTGAACCATGACAAGGATTACAACAAAGTAACG 
CTGTATGAGAATGCAGTTGCTCGCTATTCTTTGCTGCCAAGTCAGGCCTAGACAACAAGG 
ATACTGAAAACATATTTGTCTGAGGGTTTGTGTTGTTTTTTAAAAGACATCAGCTCAGCA 
TTCGTTAGTTGTAACAAAAAATAGCTTTAATTTTTGGTGGGATTAAATCATAGGGATTTG 
TTTTAGTAATCATTTTGCTTAATAAAAAGTGCCTTG (SEQ ID NO: 03) 

amino acid sequence of wild type CFP484 
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Figure 3 

cDNA sequence of zFP506 

ATGGCTCAGTCAAAGCACGGTCTAACAAAAGAAATGACAATGAAATACCGTATGGAAGGGTGC 
GTCGATGGACATAAATTTGTGATCACGGGAGAGGGCATTGGATATCCGTTCAAAGGGAAACAG 
GCTATTAATCTGTGTGTGGTCGAAGGTGGACCATTGCCATTTGCCGAAGACATATTGTCAGCT 
GCCTTTATGTACGGAAACAGGGTTTTCACTGAATATCCTCAAGACATAGCTGACTATTTCAAG 
AACTCGTGTCCTGCTGGTTATACATGGGACAGGTCTTTTCTCTTTGAGGATGGAGCAGTTTGC 
ATATGTAATGCAGATATAACAGTGAGTGTTGAAGAAAACTGCATGTATCATGAGTCCAAATTT 
TATGGAGTGAATTTTCCTGCTGATGGACCTGTGATGAAAAAGATGACAGATAACTGGGAGCCA 
TCCTGCGAGAAGATCATACCAGTACCTAAGCAGGGGATATTGAAAGGGGATGTCTCCATGTAC 
CTCCTTCTGAAGGATGGTGGGCGTTTACGGTGCCAATTCGACACAGTTTACAAAGCAAAGTCT 
GTGCCAAGAAAGATGCCGGACTGGCACTTCATCCAGCATAAGCTCACCCGTGAAGACCGCAGC 
GATGCTAAGAATCAGAAATGGCATCTGACAGAACATGCTATTGCATCCGGATCTGCATTGCCC 

(SEQ ID NO: 05) 

amino acid sequence of zFP506 

MAQSKHGLTK EMTMKYRMEG CVDGHKFVIT GEGIGYPFKG KQAINLCWE GGPLPFAEDI LSAAFNYGNR VFTEYPQDIA 
DYFKNSCPAG YTWDRSFLFE DGAVCICNAD ITVSVEENCM YHESKFYGVN FPADGPVMKK MTDNWEPSCE KIIPVPKQGI 
LKGDVSMYLL LKDGGRLRCQ FDTVYKAKSV PRKMPDWHFI QHKLTREDRS DAKNQKWHLT EHAIASGSAL P 
(SEQ ID NO: 06) 
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Figure 4 



cDNA sequence of zFP5 
gagttgagtt tctcgacttc 
catggctcat tcaaagcacg 
gtgcgtcaac ggacataaat 
gaaacagact attaatctgt 
attgtcagct ggctttaagt 
agactatttc aagaactcgt 
ggatggagca gtctgcatat 
ttatcataag agcatattta 
gatgacaact aactgggaag 
actgaaaggg gatgtctcca 
gttcgacaca gtttacaaag 
ccagcataag ctcctccgtg 
agagcatgct attgcattcc 
aatgcatgtg cttgtcaatt 



38 

agttgtatca 
gtctaaaaga 
ttgtgatcac 
gtgtgatcga 
acggagacag 
gtcctgctgg 
gcaatgtaga 
atggaatgaa 
catcctgcga 
tgtacctcct 
caaagtctgt 
aagaccgcag 
cttctgcctt 
attctgataa 



attttggggc 
agaaatgaca 
gggcgaaggc 
agggggacca 
gattttcact 
atatacatgg 
tataacagtg 
ttttcctgct 
gaagatcatg 
tctgaaggat 
gccaagtaag 
cgatgctaag 
ggcctgataa 
aaatgtagtt 



atcaagcgat 
atgaaatacc 
attggatatc 
ttgccatttt 
gaatatcctc 
ggcaggtctt 
agtgtcaaag 
gatggacctg 
ccagtaccta 
ggtgggcgtt 
atgccggagt 
aatcagaagt 
gaatgtagtt 
gagttgaaaa 



ctattttcaa 
acatggaagg 
cgttcaaagg 
ccgaagacat 
aagacatagt 
ttctctttga 
aaaactgcat 
tgatgaaaaa 
agcaggggat 
accggtgcca 
ggcacttcat 
ggcagctgac 
ccaacatttt 
cagacaagta 



caaataaagc acatgtaaat cgtct 



(SEQ ID NO: 07) 



amino acid sequence of zFP538 



Met 


Ala 
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His 
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Met 


Glu 


Gly 
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Val 
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He 


Thr 


Gly 


Glu 


Gly 
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Gly 
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Pro 
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Cys 


Val 


He 


Glu 


Gly 
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Pro 


Leu 


Pro 


Phe 


Ser 


Glu 


Asp 


He 


Leu 


Ser 


Ala 


Gly 


Phe 


Lys 


Tyr 


Gly Asp 


Arg 


He 


Phe 


Thr 


Glu 


Tyr 


Pro 


Gin 


Asp 


He 


Val 


Asp 


Tyr 


Phe 


Lys 


Asn 


Ser 


Cys 


Pro 


Ala 


Gly 


Tyr 


Thr 


Trp 


Gly 


Ser 


Phe 


Leu 


Phe 


Glu 


Asp 


Gly 


Ala 


Val 


Cys 


He 


Cys 


Asn 


Val 


Asp 


He 


Thr 


Val 


Ser 


Val 


Lys 


Glu 


Asn 


Cys 


lie 


Tyr 


His 


Lys 


Ser 


He 


Phe 


Asn 


Gly 


Met 


Asn 


Phe 


Pro 


Ala 


Asp 


Gly 


Pro 


Val 


Met 


Lys 


Lys 


Met 


Thr 


Thr 


Asn 


Trp 


Glu 


Ala 


Ser 


Cys 


Glu 


Lys 


lie 


Met 


Pro 


Val 


Pro 


Lys 


Gin 


Gly 


He 


Leu 


Lys 


Gly 


Asp 


Val 


Ser 


Met 


Tyr 


Leu 


Leu 


Leu 


Lys 


Asp 


Gly 


Gly 


Arg 


Tyr 


Arg 


Cys 


Gin 


Phe 


Asp 


Thr 


Val 


Tyr 
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Ala 


Lys 
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Val 


Pro 


Ser 


Lys 


Met 


Pro 


Glu 


Trp 


His 


Phe 


He 


Gin 


His 


Lys 


Leu 


Leu 


Arg 


Glu 


Asp 


Arg 


Ser 


Asp 


Ala 


Lys 
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Gin 
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Trp 


Gin 


Leu 


Thr 


Glu 


His 


Ala 


He 


Ala 


Phe 


Pro 


Ser 


Ala 


Leu 


Ala 


(SEQ ID 


NO: 08) 
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FIGURE 5 

cDNA sequence of dsFP4 8 3 

ACGGTCAGGGACACGGTGACCCACTTTGGTATTCTAACAAAATGAGTTGGTCCAAGAGTG 
TGATCAAGGAAGAAATGTTGATCGATCTTCATCTGGAAGGAACGTTCAATGGGCACTACT 
TTGAAATAAAAGGCAAAGGAAAAGGGAAGCCTAATGAAGGCACCAATACCGTCACGCTCG 
AGGTTACCAAGGGTGGACCTCTGCCATTTGGTTGGCATATTTTGTGCCCACAATTTCAGT 
ATGGAAACAAGGCATTTGTCCACCACCCTGACGACATACCTGATTATCTAAAGCTGTCAT 
TTCCGGAGGGATATACATGGGAACGGTCCATGCACTTTGAAGACGGTGGCTTGTGTTGTA 
TCACCAATGATATCAGTTTGACAGGCAACTGTTTCAACTACGACATCAAGTTCACTGGCT 
TGAACTTTCCTCCAAATGGACCCGTTGTGCAGAAGAAGACAACTGGCTGGGAACCGAGCA 
CTGAGCGTTTGTATCCTCGTGATGGCGTGTTGATAGGAGACATCCATCATGCTCTCACAG 
TGGAAGGAGGTGGTCATTACGTATGTGACATTAAAACTGTTTACAGGGCCAAGAAGCCCG 
TAAAGATGCCAGGGTATCACTATGTTGACACCAAACTGGTTATAAGGAGCAACGACAAAG 
AATTCATGAAAGTTGAGGAGCATGAAATCGCCGTTGCACGCCACCATCCGCTCCAAAGCC 
AATGAAGCTTAAGTAAAGCAAAAAGGTGACGAGGCATGATAGTATGACATGATAGTATGA 
CATGATAGTATGACATGATAGTAAGAATTGTAAGCAAAAGGCTTTGCTTATTAAACTTGT 
AATTGAAAAC (SEQ ID NO: 09) 



amino acid sequence of dsFP483 
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(SEQ ID NO: 10) 
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FIGURE 6 

cDNA sequence of drFP583 

ATGAGGTCTTCCAAGAATGTTATCAAGGAGTTCATGAGGTTTAAGGTTCGCATGGAAGGAACGGTCAATGGGCACGAGT 
TTGAAATAGAAGGCGAAGGAGAGGGGAGGCCATACGAAGGCCACAATACCGTAAAGCTTAAGGTAACCAAGGGGGGACC 
TTTGCCATTTGCTTGGGATATTTTGTCACCACAATTTCAGTATGGAAGCAAGGTATATGTCAAGCACCCTGCCGACATA 
CCAGACTATAAAAAGCTGTCATTTCCTGAAGGATTTAAATGGGAAAGGGTCATGAACTTTGAAGACGGTGGCGTCGTTA 
CTGTAACCCAGGATTCCAGTTTGCAGGATGGCTGTTTCATCTACAAGGTCAAGTTCATTGGCGTGAACTTTCCTTCCGA 
TGGACCTGTTATGCAAAAGAAGACAATGGGCTGGGAAGCCAGCACTGAGCGTTTGTATCCTCGTGATGGCGTGTTGAAA 
GGAGAGATTCATAAGGCTCTGAAGCTGAAAGACGGTGGTCATTACCTAGTTGAATTCAAAAGTATTTACATGGCAAAGA 
AGCCTGTGCAGCTACCAGGGTACTACTATGTTGACTCCAAACTGGATATAACAAGCCACAACGAAGACTATACAATCGT 
TGAGCAGTATGAAAGAACCGAGGGACGCCACCATCTGTTCCTTTAA (SEQ ID NO: 11) 



cDNA sequence of drFP583.1 



GTCCTCCCAAGCAGTGGTATCAACGCAGAGTACGGGGGAGTTTCAGCCAGTGACGGT 
CAGTGACAGGGTGAGCCACTTGGTATACCAACAAAATGAGGTCTTCCAAGAATGTTA 
TCAAGGAGTTCATGAGGTTTAAGGTTCGCATGGAAGGAACGGTCAATGGGCACGAGT 
TTGAAATAGAAGGCGAAGGAGAGGGGAGGCCATACGAAGGCCACAATACCGTAAAGC 
□ TTAAGGTAACCAAGGGGGGACCTTTGCCATTTGCTTGGGATATTTTGTCACCACAAT 
O TTCAGTATGGAAGCAAGGTATATGTCAAGCACCCTGCCGACATACCAGACTATAAAA 
O AGCTGTCATTTCCTGAAGGATTTAAATGGGAAAGGGTCATGAACTTTGAAGACGGTG 
fjl GCGTCGTTACTGTAACCCAGGATTCCAGTTTGCAGGATGGCTGTTTCATCTACAAGT 
gp CAAGTTCATTGGCGTTGAACTTTCCTTCCGATGGACCTGTTATGCAAAAGAAGACAA 
pj TGGGCTGGGAAGCCAGCACTGAGCGTTTGTATCCTCGTGATGGCGTGTTGAAAGGAG 
AGATTCATAAGGCTCTGAAGCTGAAAGACGGTGGTCATTACCTAGTTGAATTCAAAA 
1 ~ GTATTTACATGGCAAAGAAGCCTGTGCAGCTACCAGGGTACTACTATGTTGACTCCA 
f . AACTGGATATAACAAGCCACAACGAAGACTATACAATCGTTGAGCAGTATGAAAGAA 
^* CCGAGGGACGCCACCATCTGTTCCTTTAAGGCTGAACTTGGCTCAGACGTGGGTGAG 
[I! CGGTAATGACCACAAAAGGCAGCGAAGAAAAACCATGATCGTTTTTTTTAGGTTGGC 
Q AGCCTGAAATCGTAGGAAATACATCAGAAATGTTACAAACAGG (SEQ ID NO: 45) 

p amino acid sequence of drFP583 

MRSSKNVIKEFMRFKVRMEGTVNGHEFEIEGEGEGRPYEGHNTVKLKVTKGGPLPFAWDILSPQFQ 
YGSKVYVKHPADIPDYKKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGCFI YKVKFIGVNFPSD 
GPVMQKKTMGWEASTERLYPRDGVLKGEIHKALKLKDGGHYLVEFKSIYMAKKPVQLPGYYYVDSK 
LDITSHNEDYTIVEQYERTEGRHHLFL SEQ ID NO: 012) 



amino acid sequence of drFP583 . 1 



Met 


Arg 


Ser 


Ser 


Lys 


Asn 


Val 


He 


Lys 


Glu 


Phe 


Met 


Arg 


Phe 


Lys 


Val 


Arg 


Met 


Glu 


Gly 


Thr 


Val 


Asn 


Gly 


His 


Glu 


Phe 


Glu 


He 


Glu 


Gly 


Glu 


Gly 


Glu 


Gly 


Arg 


Pro 


Tyr 


Glu 


Gly 


His 


Asn 


Thr 


Val 


Lys 


Leu 


Lys 


Val 


Thr 


Lys 


Gly 


Gly 


Pro 


Leu 


Pro 


Phe 


Ala 


Trp 


Asp 


He 


Leu 


Ser 


Pro 


Gin 


Phe 


Gin 


Tyr 


Gly 


Ser 


Lys 


Val 


Tyr 


Val 


Lys 


His 


Pro 


Ala 


Asp 


He 


Pro 


Asp 


Tyr 


Lys 


Lys 


Leu 


Ser 


Phe 


Pro 


Glu 


Gly 


Phe 


Lys 


Trp 


Glu 


Arg 


Val 


Met 


Asn 


Phe 


Glu 


Asp 


Gly 


Gly 


Val 


Val 


Thr 


Val 


Thr 


Gin 


Asp 


Ser 


Ser 


Leu 


Gin 


Asp 


Gly 


Cys 


Phe 


He 


Tyr 


Lys 


Ser 


Ser 


Ser 


Leu 


Ala 


Leu 


Asn 


Phe 


Pro 


Ser 


Asp 


Gly 


Pro 


Val 


Met 


Gin 


Lys 


Lys 


Thr 


Met 


Gly 


Trp 


Glu 


Ala 


Ser 


Thr 


Glu 


Arg 


Leu 


Gly 


His 


Tyr 


Leu 


Val 


Glu 


Phe 


Lys 


Ser 


He 


He 


Met 


Ala 


Lys 


Lys 


Pro 


Val 


Gin 


Leu 


Pro 


Gly 


Tyr 


Tyr 


Tyr 


Val 


Asp 


Ser 


Lys 


Leu 


Asp 


He 


Thr 


Ser 


His 


Asn 


Glu 


Asp 


Tyr 


Thr 


He 


Val 


Glu 


Gin 


Tyr 


Glu 


Arg 


Ser 


Glu 


Gly 


Arg 


His 


His 


Leu 


Phe 


Leu 













(SEQ ID NO: 46) 
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FIGURE 7 

Amino Acid and Nucleotide Sequence for asFP600 

ATGGCTTCCTTTTTAAAGAAGACTATGCCCTTTAAGACGACCATTGAAGGGACGGTTAATGGCCAC 
TACTTCAAGTGTACAGGAAAAGGAGAGGGCAACCCATTTGAGGGTACGCAGGAAATGAAGATAGAG 
GTCATCGAAGGAGGTCCATTGCCATTTGCCTTCCACATTTTGTCAACGAGTTGTATGTACGGTAGT 
AAGGCCTTCATCAAGTATGTGTCAGGAATTCCTGACTACTTCAAGCAGTCTTTCCCTGAAGGTTTT 
ACTTGGGAAAGAACCACAACCTACGAGGATGGAGGCTTTCTTACAGCTCATCAGGACACAAGCCTA 
GATGGAGATTGCCTCGTTTACAAGGTCAAGATTCTTGGTAATAATTTTCCTGCTGATGGCCCCGTG 
ATGCAGAACAAAGCAGGAAGATGGGAGCCATCCACCGAGATAGTTTATGAAGTTGACGGTGTCCTG 
CGTGGACAGTCTTTGATGGCCCTTAAGTGCCCTGGTGGTCGTCATCTGACTTGCCATCTCCATACT 
ACTTACAGGTCCAAAAAACCAGCTGCTGCCTTGAAGATGCCAGGATTTCATTTTGAAGATCATCGC 
ATCGAGATAATGGAGGAAGTTGAGAAAGGCAAGTGCTATAAACAGTACGAAGCAGCAGTGGGCAGG 
TACTGTGATGCTGCTCCATCCAAGCTTGGACATAAC (SEQ ID NO: 13) 

Amino acid 

MASFLKKTMP FKTTIEGTVN GHYFKCTGKG EGNPFEGTQE MKIEVIEGGP LPFAFHILST 
SCMYGSKTFI KYVSGI PDYF KQSFPEGFTW ERTTTYEDGG FLTAHQDTSL DGDCLVYKVK 
I LGNNFPADG PVMQNKAGRW EPATEIVYEV DGVLRGQSLM ALKCPGGRHL TCHLHTTYRS 
KKPAAALKMP GFHFEDHRIE IMEEVEKGKC YKQYEAAVGR YCDAAPSKLG HN (SEQ ID 
NO: 14) 



Figure 8 
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cDNA sequence of dgFP512 



attcacctcg 


gtgatttgta 


agagaaagga 


tcaccatcaa 


gagaagagct 


gtaaaagtta 


60 


atattttact 


gtacttctac 


cagcatgagt 


gcacttaaag 


aagaaatgaa 


aatcaacctt 


120 


acaatggaag 


gtgttgttaa 


cgggcttcca 


tttaagatcc 


gtggggatgg 


aaaaggcaaa 


180 


ccataccagg 


gatcacagga 


gttaaccttg 


acggtggtta 


aaggcgggcc 


tctgcctttc 


240 


tcttatgata 


ttctgacaac 


gatgtttcag 


tacggcaaca 


gggcattcgt 


aaactaccca 


300 


gaggacatac 


cagatatttt 


caagcagacc 


tgttctggtc 


ctaatggtgg 


atattcctgg 


360 


caaaggacca 


tgacttatga 


agacggaggc 


gtttgcactg 


ctacaagcaa 


catcagcgtg 


420 


gttggcgaca 


ctttcaatta 


tgacattcac 


tttatgggag 


cgaattttcc 


tcttgatggt 


480 


ccagtgatgc 


agaaaagaac 


aatgaaatgg 


gaaccatcca 


ctgagataat 


gtttgaacgt 


540 


gatggaatgc 


tgaggggtga 


cattgccatg 


tctctgttgc 


tgaagggagg 


gggccattac 


600 


cgatgtgatt 


ttgaaactat 


ttataaaccc 


aataaggttg 


tcaagatgcc 


agattaccat 


660 


tttgtggacc 


actgcattga 


gataacgagt 


caacaggatt 


attacaacgt 


ggttgagctg 


720 


accgaggttg 


ctgaagcccg 


ctactcttcg 


ctggagaaaa 


tcggcaaatc 


aaaggcgtaa 


780 


atccaagcaa 


tctaagaaaa 


caacaaggca 


ttaaaccgaa 


tcaccgtttt 


gaatttttcg 


840 


ttcggaattt 


cttggtaaaa 


ctaggtttag 


aacgtttcat 


ttcgctggac 


ttctttgact 


900 


cagctgtaga 


caagaaaga 


(SEQ ID NO: 15) 
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fy 

FU amino acid sequence of dgFP512 



Met 


Ser 


Ala 


Leu 


Lys 


Glu 


Glu 


Met 


Lys 


He 


Asn 


Leu 


Thr 


Met 


Glu 


Gly 


Val 


Val 


Asn 


Gly 


Leu 


Pro 


Phe 


Lys 


He 


Arg 


Gly 


Asp 


Gly 


Lys 


Gly 


Lys 


Pro 


Tyr 


Gin 


Gly 


Ser 


Gin 


Glu 


Leu 


Thr 


Leu 


Thr 


Val 


Val 


Lys 


Gly 


Gly 


Pro 


Leu 


Pro 


Phe 


Ser 


Tyr 


Asp 


He 


Leu 


Thr 


Thr 


Met 


Phe 


Gin 


Tyr 


Gly 


Asn 


Arg 


Ala 


Phe 


Val 


Asn 


Tyr 


Pro 


Glu 


Asp 


He 


Pro 


Asp 


He 


Phe 


Lys 


Gin 


Thr 


Cys 


Ser 


Gly 


Pro 


Asn 


Gly 


Gly 


Tyr 


Ser 


Trp 


Gin 


Arg 


Thr 


Met 


Thr 


Tyr 


Glu 


Asp 


Gly 


Gly 


Val 


Cys 


Thr 


Ala 


Thr 


Ser 


Asn 


He 


Ser 


Val 


Val 


Gly 


Asp 


Thr 


Phe 


Asn 


Tyr 


Asp 


lie 


His 


Phe 


Met 


Gly 


Ala 


Asn 


Phe 


Pro 


Leu 


Asp 


Gly 


Pro 


Val 


Met 


Gin 


Lys 


Arg 


Thr 


Met 


Lys 


Trp 


Glu 


Pro 


Ser 


Thr 


Glu 


He 


Met 


Phe 


Glu 


Arg 


Asp 


Gly 


Met 


Leu 


Arg 


Gly 


Asp 


He 


Ala 


Met 


Ser 


Leu 


Leu 


Leu 


Lys 


Gly 


Gly 


Gly 


His 


Tyr 


Arg 


Cys 


Asp 


Phe 


Glu 


Thr 


He 


Tyr 


Lys 


Pro 


Asn 


Lys 


Val 


Val 


Lys 


Met 


Pro 


Asp 


Tyr 


His 


Phe 


Val 


Asp 


His 


Cys 


He 


Glu 


He 


Thr 


Ser 


Gin 


Gin 


Asp 


Tyr 


Tyr 


Asn 


Val 


Val 


Glu 


Leu 


Thr 


Glu 


Val 


Ala 


Glu 


Ala 


Arg 


Tyr 


Ser 


Ser 


Leu 


Glu 


Lys 



He Gly Lys Ser Lys Ala 
(SEQ ID NO: 16) 
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FIGURE 9 

cDNA sequence of dmFP592 



agtttcagcc 


agtgacaggg 


tgagctgcca 


ggtattctaa 


caagatgagt 


tgttccaaga 


60 


atgtgatcaa 


ggagttcatg 


aggttcaagg 


ttcgtatgga 


aggaacggtc 


aatgggcacg 


120 


agtttgaaat 


aaaaggcgaa 


ggtgaaggga 


ggccttacga 


aggtcactgt 


tccgtaaagc 


180 


ttatggtaac 


caagggtgga 


cctttgccat 


ttgcttttga 


tattttgtca 


ccacaatttc 


240 


agtatggaag 


caaggtatat 


gtcaaacacc 


ctgccgacat 


accagactat 


aaaaagctgt 


300 


catttcctga 


gggatttaaa 


tgggaaaggg 


tcatgaactt 


tgaagacggt 


ggcgtggtta 


360 


ctgtatccca 


agattccagt 


ttgaaagacg 


gctgtttcat 


ctacgaggtc 


aagttcattg 


420 


gggtgaactt 


tccttctgat 


ggacctgtta 


tgcagaggag 


gacacggggc 


tgggaagcca 


480 


gctctgagcg 


tttgtatcct 


cgtgatgggg 


tgctgaaagg 


agacatccat 


atggctctga 


54 0 


ggctggaagg 


aggcggccat 


tacctcgttg 


aattcaaaag 


tatttacatg 


gtaaagaagc 


600 


cttcagtgca 


gttgccaggc 


tactattatg 


ttgactccaa 


actggatatg 


acgagccaca 


660 


acgaagatta 


cacagtcgtt 


gagcagtatg 


aaaaaaccca 


gggacgccac 


catccgttca 


720 


ttaagcctct 


gcagtgaact 


cggctcagtc 


atggattagc 


ggtaatggcc 


acaaaaggca 


780 


cgatgatcgt 


tttttaggaa 


tgcagccaaa 


aattgaaggt 


tatgacagta 


gaaatacaag 


840 


caacaggctt 


tgcttattaa 


acatgtaatt 


gaaaac 






876 



yy 

HI (SEQ ID NO: 17) 

Hi amino acid sequence of dmFP5 92 



Met 


Ser 


Cys 


Ser 


Lys 


Asn 


Val 


He 


Lys 


Glu 


Phe 


Met 


Arg 


Phe 


Lys 


Val 


Arg 


Met 


Glu 


Gly 


Thr 


Val 


Asn 


Gly 


His 


Glu 


Phe 


Glu 


He 


Lys 


Gly 


Glu 


Gly 


Glu 


Gly 


Arg 


Pro 


Tyr 


Glu 


Gly 


His 


Cys 


Ser 


Val 


Lys 


Leu 


Met 


Val 


Thr 


Lys 


Gly 


Gly 


Pro 


Leu 


Pro 


Phe 


Ala 


Phe 


Asp 


He 


Leu 


Ser 


Pro 


Gin 


Phe 


Gin 


Tyr 


Gly 


Ser 


Lys 


Val 


Tyr 


Val 


Lys 


His 


Pro 


Ala 


Asp 


He 


Pro 


Asp 


Tyr 


Lys 


Lys 


Leu 


Ser 


Phe 


Pro 


Glu 


Gly 


Phe 


Lys 


Trp 


Glu 


Arg 


Val 


Met 


Asn 


Phe 


Glu 


Asp 


Gly 


Gly 


Val 


Val 


Thr 


Val 


Ser 


Gin 


Asp 


Ser 


Ser 


Leu 


Lys 


Asp 


Gly Cys 


Phe 


He 


Tyr 


Glu 


Val 


Lys 


Phe 


He 


Gly 


Val 


Asn 


Phe 


Pro 


Ser 


Asp 


Gly 


Pro 


Val 


Met 


Gin 


Arg 


Arg 


Thr 


Arg 


Gly 


Trp 


Glu 


Ala 


Ser 


Ser 


Glu 


Arg 


Leu 


Tyr 


Pro 


Arg 


Asp 


Gly 


Val 


Leu 


Lys 


Gly 


Asp 


He 


His 


Met 


Ala 


Leu 


Arg 


Leu 


Glu 


Gly 


Gly 


Gly 


His 


Tyr 


Leu 


Val 


Glu 


Phe 


Lys 


Ser 


He 


Tyr 


Met 


Val 


Lys 


Lys 


Pro 


Ser 


Val 


Gin 


Leu 


Pro 


Gly 


Tyr 


Tyr 


Tyr 


Val 


Asp 


Ser 


Lys 


Leu 


Asp 


Met 


Thr 


Ser 


His 


Asn 


Glu 


Asp 


Tyr 


Thr 


Val 


Val 


Glu 


Gin 


Tyr 


Glu 


Lys 


Thr 


Gin 


Gly 


Arg 


His 


His 


Pro 


Phe 



lie Lys Pro Leu Gin 
(SEQ ID NO: 18) 
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Figure 10 







M 


A 


L 


s 


N 


E 


F 


I 


G 


D 


D 


M 


K 


M 


676 




ATG 


GCC 


CTG 


TCC 


AAC 


GAG 


TTC 


ATC 


GGC 


GAC 


GAC 


ATG 


AAG 


ATG 






TAC 


CGG 


GAC 


AGG 


TTG 


TTC 


AAG 


TAG 


CCG 


CTG 


CTG 


TAC 


TTC 


TAC 




T 


y 


H 


M 


D 


G 


c 


V 


N 


G 


H 


Y 


F 


T 


V 


721 


ACC 


TAC 


CAC 


ATG 


GAC 


GGC 


TGC 


GTG 


AAC 


GGC 


CAC 


TAC 


TTC 


ACC 


GTG 




TGG 


ATG 


GTG 


TAC 


CTG 


CCG 


ACG 


CAC 


TTG 


CCG 


GTG 


ATG 


AAG 


TGG 


CAC 




K 


G 


E 


G 


s 


G 


K 


p 


Y 


E 


G 


T 


0 


T 


s 


766 


AAG 


GGC 


GAG 


GGC 


AGC 


GGC 


AAG 


CCC 


TAC 


GAG 


GGC 


ACC 


CAG 


ACC 


TCC 




TTC 


CCG 


CTC 


CCG 


TCG 


CCG 


TTC 


GGG 


ATG 


CTC 


CCG 


TGG 


GTC 


TGG 


AGG 




T 


F 


K 


V 


T 


M 


A 


N 


G 


G 


p 


L 


A 


F 


s 


811 


ACC 


TTC 


AAG 


GTG 


ACC 


ATG 


GCC 


AAC 


GGC 


GGC 


CCC 


CTG 


GCC 


TTC 


TCC 




TGG 


AAG 


TTC 


CAC 


TGG 


TAC 


CGG 


TTG 


CCG 


CCG 


GGG 


GAC 


CGG 


AAG 


AGG 




F 


D 


I 


L 


s 


T 


V 


F 


M 


Y 


G 


N 


R 


c 


F 


856 


TTC 


GAC 


ATC 


CTG 


TCC 


ACC 


GTG 


TTC 


ATG 


TAC 


GGC 


AAC 


CGC 


TGC 


TTC 




AAG 


CTG 


TAG 


GAC 


AGG 


TGG 


CAC 


AAG 


TAC 


ATG 


CCG 


TTG 


GCG 


ACG 


AAG 




T 


A 


Y 


p 


T 


s 


M 


p 




Y 


F 


K 


n 


A 


F 


901 


ACC 


GCC 


TAC 


CCC 


ACC 


AGC 


ATG 


CCC 


GAC 


TAC 


TTC 


AAG 


"CAG 


GCC 


TTC 




TGG 


CGG 


ATG 


GGG 


TGG 


TCG 


TAC 


GGG 


CTG 


ATG 


AAG 


TTC 


GTC 


CGG 


AAG 




p 




G 


M 


s 


Y 


E 


R 


T 


F 


T 


Y 


E 




G 


946 


CCC 


GAC 


GGC 


ATG 


TCC 


TAC 


GAG 


AGA 


ACC 


TTC 


ACC 


TAC 


GAG 


GAC 


GGC 




GGG 


CTG 


CCG 


TAC 


AGG 


ATG 


CTC 


TCT 


TGG 


AAG 


TGG 


ATG 


CTC 


CTG 


CCG 




Q 


v 


A 


T 


A 


g 


W 


E 


j 


s 


L 




G 


N 




991 


GGC 


GTG 


GCC 


ACC 


GCC 


AGC 


TGG 


GAG 


ATC 


AGC 


CTG 


AAG 


GGC 


AAC 


TGC 




CCG 


CAC 


CGG 


TGG 


CGG 


TCG 


ACC 


CTC 


TAG 


TCG 


GAC 


TTC 


CCG 


TTG 


ACG 




F 


E 


H 




s 


T 


F 




G 


v 


N 


F 


p 


A 




103 6 


TTC 


GAG 


CAC 


AAG 


TCC 


ACC 


TTC 


CAC 


GGC 


GTG 


AAC 


TTC 


CCC 


GCC 


GAC 




AAG 


CTC 


GTG 


TTC 


AGG 


TGG 


AAG 


GTG 


CCG 


CAC 


TTG 


AAG 


GGG 


CGG 


CTG 




Q 


p 


v 


M 


A 


x 


K 


T 


T 


G 


W 




p 


s 


p 


1081 


GGC 


CCC 


GTG 


ATG 


GCC 


AAG 


AAG 


ACC 


ACC 


GGC 


TGG 


GAC 


CCC 


TCC 


TTC 




CCG 


GGG 


CAC 


TAC 


CGG 


TTC 


TTC 


TGG 


TGG 


CCG 


ACC 


CTG 


GGG 


AGG 


AAG 




E 




M 


T 


v 




£) 


G 


I 


L 




Q 




v 


T 


112 6 


GAG 


AAG 


ATG 


ACC 


GTG 


TGC 


GAC 


GGC 


ATC 


A iV3 




nor 1 


GAC 




AfP 




CTC 


TTC 


TAC 


TGG 


CAC 


ACG 


CTG 


CCG 


TAG 


AAC 


TTC 


CCG 


CTG 


CAC 


TGG 




A 


F 


L 


M 


L 


o 

V 


G 


G 


G 


N 


y 


R 


Q 




F 


1171 


GCC 


TTC 


CTG 


ATG 


CTG 


CAG 


GGC 


GGC 


GGC 


AAC 


TAC 


AGA 


TGC 


CAG 


TTC 




CGG 


AAG 


GAC 


TAC 


GAC 


GTC 


CCG 


CCG 


CCG 


TTG 


ATG 


TCT 


ACG 


GTC 


AAG 




jj 


T 


s 


y 




T 






p 




T 


M 


p 


p 


N 


1216 


CAC 


ACC 


TCC 


TAC 


AAG 


ACC 


AAG 


AAG 


CCC 


GTG 


ACC 


ATG 


CCC 


CCC 


AAC 




GTG 


TGG 


AGG 


ATG 


TTC 


TGG 


TTC 


TTC 


GGG 


CAC 


TGG 


TAC 


GGG 


GGG 


TTG 






v 


v 


E 




R 


I 


A 


R 


T 


D 


L 






G 


1261 


CAC 


GTG 


GTG 


GAG 


CAC 


CGC 


ATC 


GCC 


AGA 


ACC 


GAC 


CTG 


GAC 


AAG 


GGC 




GTG 


CAC 


CAC 


CTC 


GTG 


GCG 


TAG 


CGG 


TCT 


TGG 


CTG 


GAC 


CTG 


TTC 


CCG 




G 


N 


S 


V 


Q 


L 


T 


E 


H 


A 


V 


A 


H 


I 


T 


1306 


GGC 


AAC 


AGC 


GTG 


CAG 


CTG 


ACC 


GAG 


CAC 


GCC 


GTG 


GCC 


CAC 


ATC 


ACC 




CCG 


TTG 


TCG 


CAC 


GTC 


GAC 


TGG 


CTC 


GTG 


CGG 


CAC 


CGG 


GTG 


TAG 


TGG 




S 


V 


V 


P 


F 


* 




















1351 


TCC 


GTG 


GTG 


CCC 


TTC 


TGA 






















AGG 


CAC 


CAC 


GGG 


AAG 


ACT 




(SEQ ID 


NO: 27 & 


28) 
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Figure 11 

Non-aggregating mutant FP3-NA was generated from zFP506-N65M (non-humanized version). In 
comparison with zFP506-N65M, FP3-NA contains two additional amino acid substitutions - K5E 
and K10E. Also, one accidental nucleotide substitution was introduced due to PCR mistake 
(double underline). 

Cloning into pQE30 was done using BamHI and Hindlll sites: 

GGA TCC GCT CAG TCA GAG CAC GGT CTA ACA GAA GAA ATG ACA ATG AAA 
BamHI AQSEHGLTEEMTMK 

TAC CGT ATG GAA GGG TGC GTC GAT GGA CAT AAA TTT GTG ATC ACG GGA 
YRMEGCVDGHKFVI TG 

GAG GGC ATT GGA TAT CCG TTC AAA GGG AAA CAG GCT ATT AAT CTG TGT 
EGIGYPFKGKQAINLC 

GTG GTC GAA GGT GGA CCA TTG CCA TTT GCC GAA GAC ATA TTG TCA GCT 
VVEGGPLPFAEDILSA 

GCC TTT ATG TAC GGA AAC AGG GTT TTC ACT GAA TAT CCT CAA GAC ATA 
AFMYGNRVFTEYPQDI 

GTT GAC TAT TTC AAG AAC TCG TGT CCT GCT GGA TAT ACA TGG GAC AGG 
VDYFKNS CPAGYTWDR 

TCT TTT CTC TTT GAG GAT GGA GCA GTT TGC ATA TGT AAT GCA GAT ATA 
SFLFEDGAVCICNADI 

ACA GTG AGT GTT GAA GAA AAC TGC ATG TAT CAT GAG TCC AAA TTC TAT 
TVSVEENCMYHESKFY 

GGA GTG AAT TTT CCT GCT GAT GGA CCT GTG ATG AAA AAG ATG ACA GAT 
GVNFPADGPVMKKMTD 

AAC TGG GAG CCA TCC TGC GAG AAG ATC ATA CCA GTA CCT AAG CAG GGG 
NWEPSCEKI IPVPKQG 

ATA TTG AAA GGG GAT GTC TCC ATG TAC CTC CTT CTG AAG GAT GGT GGG 
ILKGDVSMYLLLKDGG 

CGT TTA CGG TGC CAA TTC GAC ACA GTT TAC AAA GCA AAG TCT GTG CCA 
RLRCQFDTVYKAKSVP 

AGA AAG ATG CCG GAC TGG CAC TTC ATC CAG CAT AAG CTC ACC CGT GAA 
RKMPDWHF I QHKLTRE 

GAC CGC AGC GAT GCT AAG AAT CAG AAA TGG CAT CTG ACA GAA CAT GCT 
DRSDAKNQKWHLTEHA 



ATT GCA TCC GGA TCT GCA TTG CCC TGA AAG CTT 

IASGSALP* Hindlll (SEQ ID NO.29 & 30) 
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Figure 12 

Amino acid sequence of ZFP506 Yellow mutant 

^4AQSKHGLTKEMTMKYRMEGCVDGHKFVITGEGIGYPFKGKQAINIiCWEGGPLPFAEDILSAGFKYGDRVFTEYPQDI 
VDYFKNSCPAGYTWDRSFLFEDGAVCICNADITVSVEENCMYHESKFYGVNFPADGPVMKKMTDNWEPSCEKIIPVPKQ 
GILKGDVSMYLLLKDGGRLRCQFDTVYKAKSVPRKMPDWHFIQHKLTREDRSDAKNQKWHLTEHAIASGSALP* 
(SEQ ID NO: 31) 



Figure 13 

Amino Acid Sequence of zFP5 06 Yellow/bright mutant 

MAQSKHGLTKEMTMKYRMEGCVDGHKFVITGEGIGYPFKGKQAINLCWEGGPLPFAEDILSAGFKYGDRVFTEYPQDI 
VDYFKNSCPAGYTWNRSFLFEDGAVCICNADITVSVEENCVYHESKFYGVNFPADGPVMKKMTDNWEPSCEKIIPVPRQ 
GILKGDVSMYLLLKDGGRLRCQFDTVYKAKSVPRKMPDWHFIQHKLTREDRSDAKNQKWHLTEHAIASGSALS* 
(SEQ ID NO: 32) 




Figure 14 

Non-aggregating mutant FP4-NA was generated from zFP538-M128V (humanized version). In 
comparison with zFP538-M128V, FP4-NA contains two additional amino acid substitutions - K5E 
and K9T. Also, two accidental nucleotide substitutions were introduced due to PCR mistakes 
(double underline). 



Cloning into pQE30 was done using BamHI and Hindlll sites: 

GGA TCC GCC CAC AGC GAG CAC GGC CTG ACC GAG GAG ATG ACC ATG AAG 
BamHI AHSEHGLTEEMTMK 

TAC CAC ATG GAG GGC TGC GTG AAC GGC CAC AAG TTC GTG ATC ACC GGC 
YHMEGCVNGHKFVITG 

GAG GGC ATC GGC TAC CCC TTC AAG GGC AAG CAG ACC ATC AAC CTG TGC 
EGIGYPFKGKQTINLC 

GTG ATC GAG GGC GGC CCC CTG CCC TTC AGC GAG GAC ATC CTG AGC GCC 
VIEGGPLPFSEDILSA 

GGC TTC AAG TAC GGC GAC CGG ATC TTC ACC GAG TAC CCC CAG GAC ATC 
GFKYGDRIFTEYPQDI 

GTG GAC TAC TTC AAG AAC AGC TGC CCC GCC GGC TAC ACC TGG GGC CGG 
VDYFKNSCPAGYTWGR 

AGC TTC CTG TTC GAG GAC GGC GCC GTG TGC ATC TGT AAC GTG GAC ATC 
SFLFEDGAVCICNVDI 

ACC GTG AGC GTG AAG GAG AAC TGC ATC TAC CAC AAG AGC ATC TTC AAC 
TV SVKENC IYHKS I FN 

GGC GTG AAC TTC CCC GCC GAC GGC CCC GTG ATG AAG AAG ATG ACC ACC 
GVNFPADGPVMKKMTT 

AAC TGG GAG GCC AGC TGC GAG AAG ATC ATG CCC GTG CCT AAG CAG GGC 
NWEASCEKIMPVPKQG 

ATC CTG AAG GGC GAC GTG AGC ATG TAC CTG CTG CTG AAG GAC GGC GGC 
ILKGDVSMYLLLKDGG 

CGG TAC CGG TGC CAG TTC GAC ACC GTG TAC AAG GCC AAG AGC GTG CCC 
RYRCQFDTVYKAKSVP 

AGC AAG ATG CCC GAG TGG CAC TTC ATC CAG CAC AAG CTG CTG CGG GAG 
SKMPEWHFIQHKLLRE 

GAC CGG AGC GAC GCC AAG AAC CAG AAG TGG CAG CTG ACC GAG CAC GCC 
DRSDAKNQKWQLTEHA 



ATC GCC TTC CCC AGC GCC CTG GCC TGA AAGCTT 

I A F P S A L A * Hindlll (SEQ ID NOS: 33-34) 
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Figure 15 

All mutants are derived from drFP583 (called "pink" or FP6 . ) by random 
mutagenesis 

The mutants E57 and AG4 are derivative from E5 

Mutant: E5 = V105A, S197T Phenotype: in E.coli seen as Green overnight, matures 

to Red over 24h at 37°C (final peaks ratio Red vs. Green is 75:25); folding is faster then FP6. 

Mutant: E8 = N42H Phenotype: always has two peaks Green & Red in approx. 60:40; folding is faster than E5 (about 
8h at37°C) 

Mutant: E83 = N42H, V71 A, I180V Phenotype: always has two almost equal peaks Green & Red; folding is the same 
as for E8 

Mutant: E5up = V105A Phenotype: seen as Red from the beginning; folding is faster than E5 (about 12-16h) Almost 
no Green peak at final point of maturation 

Mutant: E57 = V105A, I161T, S197A Phenotype: at common is like E5up but folding is more faster (no more that 8- 

lOh) Very small Green peak at final point of maturation (less that 5%) 

Mutant: E5down = S197T Phenotype and folding rate are exactly the same as for E5 

Mutant: AG4 = V71M, V105A, S197T Phenotype: Very bright Green, no Red at all (even at the beginning); folding is 
faster than E5 (no more that 16h) 

Mutant: AG4 = V71M, V105A, Y120H, S197T Phenotype: at common is like AG4, but more brighter (appox. twice) 



1 


Met 


Arg 


Ser 


Ser 


Lys 


Asn 


Val 


He 


Lys 


Glu 


Phe 


Met 


Arg 


Phe 


Lys 


Val 


16 


1 


ATG 


CGC 


TCC 


TCC 


AAG 


AAC 


GTC 


ATC 


AAG 


GAG 


TTC 


ATG 


CGC 


TTC 


AAG 


GTG 


48 


17 


Arg 


Met 


Glu 


Gly 


Thr 


Val 


Asn 


Gly 


His 


Glu 


Phe 


Glu 


He 


Glu 


Gly 


Glu 


32 


A Q 


CGC 


ATG 


GAG 


GGC 


ACC 


\j 1 la 


AAC 


GGC 


CAC 


GAG 


TTC 


GAG 


ATC 


GAG 


GGC 


GAG 


96 






















His (CAC) for E8 


and 


E83 






33 


Gly 


Glu 


Gly 


Arg 


Pro 


Tyr 


Glu 


Gly His 


Asn 


Thr 


Val 


Lys 


Leu 


Lys 


Val 


48 




GGC 


GAG 


GGC 


CGC 


CCC 


TAC 


GAG 


GGC 


CAC 


AAC 


ACC 


GTG 


AAG 


CTG 


AAG 


GTG 


14 4 


49 


Thr 


Lys 


Gly 


Gly 


Pro 


Leu 


Pro 


Phe 


Ala 


Trp 


Asp 


He 


Leu 


Ser 


Pro 


Gin 


64 


145 


ACC 


AAG 


GGC 


GGC 


CCC 


CTG 


CCC 


TTC 


GCC 


TGG 


GAC 


ATC 


CTG 


TCC 


CCC 


CAG 


192 
















Met (ATG) 


for AG4 and AG4 5 /Ala. (GCG) for E83 




65 


Phe 


Gin 


Tyr 


Gly 


Ser 


Lys 


Val 


Tyr 


Val 


Lys 


His 


Pro 


Ala 


Asp 


He 


Pro 


80 


193 


TTC 


CAG 


TAC 


GGC 


TCC 


AAG 


GTG 


TAC 


GTG 


AAG 


CAC 


CCC 


GCC 


GAC 


ATC 


CCC 


240 


81 


Asp 


Tyr 


Lys 


Lys 


Leu 


Ser 


Phe 


Pro 


Glu 


Gly 


Phe 


Lys 


Trp 


Glu 


Arg 


Val 


96 


241 


GAC 


TAC 


AAG 


AAG 


CTG 


TCC 


TTC 


CCC 


GAG 


GGC 


TTC 


AAG 


TGG 


GAG 


CGC 


GTG 


288 




















Ala (GCG) 


-for E5, 


£57, AG4 and AG45 




97 


Met 


Asn 


Phe 


Glu 


Asp 


Gly 


Gly 


Val 


Val 


Thr 


Val 


Thr 


Gin 


Asp 


Ser 


Ser 


112 


289 


ATG 


AAC 


TTC 


GAG 


GAC 


GGC 


GGC 


GTG 


GTG 


ACC 


GTG 


ACC 


CAG 


GAC 


TCC 


TCC 


336 


















His (CAC) 


-for AG45 












113 


Leu 


Gin Asp 


Gly Cys 


Phe 


He 


Tyr 


Lys 


Val 


Lys 


Phe 


He 


Gly 


Val 


Asn 


128 


337 


CTG 


CAG 


GAC 


GGC 


TGC 


TTC 


ATC 


TAC 


AAG 


GTG 


AAG 


TTC 


ATC 


GGC 


GTG 


AAC 


384 


129 


Phe 


Pro 


Ser 


Asp Gly 


Pro 


Val 


Met 


Gin 


Lys 


Lys 


Thr 


Met 


Gly 


Trp 


Glu 


144 


385 


TTC 


ccc 


TCC 


GAC 


GGC 


CCC 


GTG 


ATG 


CAG 


AAG 


AAG 


ACC 


ATG 


GGC 


TGG 


GAG 


432 


145 


Ala 


Ser 


Thr 


Glu 


Arg 


Leu 


Tyr 


Pro 


Arg 


Asp 


Gly 


Val 


Leu 


Lys 


Gly 


Glu 


160 


433 


GCC 


TCC 


ACC 


GAG 


CGC 


CTG 


TAC 


CCC 


CGC 


GAC 


GGC 


GTG 


CTG 


AAG 


GGC 


GAG 


480 




Thr (ACC) 


for E5 7 


























161 


lie 


His 


Lys 


Ala 


Leu 


Lys 


Leu 


Lys 


Asp 


Gly 


Gly 


His 


Tyr 


Leu 


Val 


Glu 


176 


481 


ATC 


CAC 


AAG 


GCC 


CTG 


AAG 


CTG 


AAG 


GAC 


GGC 


GGC 


CAC 


TAC 


CTG 


GTG 


GAG 


528 










Val (GTC) 


for E83 




















177 


Phe 


Lys 


Ser 


He 


Tyr 


Met 


Ala 


Lys 


Lys 


Pro 


Val 


Gin 


Leu 


Pro 


Gly 


Tyr 


192 


529 


TTC 


AAG 


TCC 


ATC 


TAC 


ATG 


GCC 


AAG 


AAG 


CCC 


GTG 


CAG 


CTG 


CCC 


GGC 


TAC 


576 












Thr (ACC) 


for E5, 


AG4 and AG45/Ala (GCC) for E57 




193 


Tyr 


Tyr 


Val 


Asp 


Ser 


Lys 


Leu 


Asp 


He 


Thr 


Ser 


His 


Asn 


Glu 


Asp 


Tyr 


208 


577 


TAC 


TAC 


GTG 


GAC 


TCC 


AAG 


CTG 


GAC 


ATC 


ACC 


TCC 


CAC 


AAC 


GAG 


GAC 


TAC 


624 



209 Thr He Val Glu Gin Tyr Glu Arg Thr Glu Gly Arg His His Leu Phe Leu *** 229 
62 5 ACC ATC GTG GAG CAG TAC GAG CGC ACC GAG GGC CGC CAC CAC CTG TTC CTG TAA 6 78 
(SEQ ID NO: 11 & 12) 
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FIGURE 16 



Nucleic acid sequence of humanized drFP583 



ATGGTGCGCTCCTCCAAGAACGTCATCAAGGAGTTCATGCGCTTCAAGGTGCGCATGG 

AGGGCACCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCC 

TACGAGGGCCACAACACCGTGAAGCTGAAGGTGACCAAGGGCGGCCCCCTGCCCTTC 

GCCTGGGACATCCTGTCCCCCCAGTTCCAGTACGGCTCCAAGGTGTACGTGAAGCACC 

CCGCCGACATCCCCGACTACAAGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGC 

GCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAAGACTCCTCCCTGC 

AGGACGGCTGCTTCATCTACAAGGTGAAGTTCATCGGCGTGAACTTCCCCTCCGACGG 

CCCCGTAATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCACCGAGCGCCTGTACCC 

CCGCGACGGCGTGCTGAAGGGCGAGATCCACAAGGCCCTGAAGCTGAAGGACGGCG 

GCCACTACCTGGTGGAGTTCAAGTCCATCTACATGGCCAAGAAGCCCGTGCAGCTGCC 

CGGCTACTACTACGTGGACTCCAAGCTGGACATCACCTCCCACAACGAGGACTACAC 

CATCGTGGAGCAGTACGAGCGCACCGAGGGCCGCCACCACCTGTTCCTGTAG (SEQ ID 

NO:35) 

Figure 17 

DNA sequence (ORF) of E5-NA 

ATGGCCTCCTCCGAGAACGTCATCACCGAGTTCATGCGCTTCAAGGTGCGCATGGAGGGCACCGTGAACGGCCACGAGT 
TCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCCACAACACCGTGAAGTTGAAGGTGACCAAGGGCGGCCC 
CCTGCCCTTCGCCTGGGACATCCTGTCCCCCCAGTTCCAGTACGGCTCCAAGGTGTACGTGAAGCACCCCGCCGACATC 
CCCGACTACAAGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGCGA 
CCGTGACCCAGGACTCCTCCCTGCAGGACGGCTGCTTCATCTACAAGGTGAAGTTCATCGGCGTGAACTTCCCCTCCGA 
CGGCCCCGTGATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCACCGAGCGCCTGTACCCCCGCGACGGCGTGCTGAAG 
GGCGAGATCCACAAGGCCCTGAAGCTGAAGGACGGCGGCCACTACCTGGTGGAGTTCAAGTCCATCTACATGGCCAAGA 
AGCCCGTGCAGCTGCCCGGCTACTACTACGTGGACACCAAGCTGGACATCACCTCCCACAACGAGGACTACACCATCGT 
GGAGCAGTACGAGCGCACCGAGGGCCGCCACCACCTGTTCCTGTAA (SEQ ID NO: 36) 

Figure 18 

ATGGTGCGCT CCTCCAAGAA CGTCATCAAG GAGTTCATGC GCTTCAAGGT 
GCGCATGGAGGGCACCGTGA ACGGCCACGA GTTCGAGATC GAGGGCGAGG GCGAGGGCCG 
CCCCTACGAG GGCCACAACA CCGTGAAGCT GAAGGTGACC AAGGGCGGCC CCCTGCCCTT 
CGCCTGGGAC ATCCTGTCCC CCCAGTTCCA GTACGGCTCC AAGGTGTACG TGAAGCACCC 
CGCCGACATC CCCGACTACA AGAAGCTGTC CTTCCCCGAG GGCTTCAAGT GGGAGCGCGT 
GATGAACTTCGAGGACGGCG GCGTGGCGAC CGTGACCCAA GACTCCTCCC TGCAGGACGG 
CTGCTTCATC TACAAGGTGA AGTTCATCGG CGTGAACTTC CCCTCCGACG GCCCCGTAAT 
GCAGAAGAAG ACCATGGGCT GGGAGGCCTC CACCGAGCGC CTGTACCCCC GCGACGGCGT 
GCTGAAGGGC GAGACCCACA AGGCCCTGAA GCTGAAGGAC GGCGGCCACT ACCTGGTGGA 
GTTCAAGTCC ATCTACATGG CCAAGAAGCC CGTGCAGCTG CCCGGCTACT ACTACGTGGA 
CGCCAAGCTG GACATCACCT CCCACAACGA GGACTACACC ATCGTGGAGC AGTACGAGCG 
CACCGAGGGCCGCCACCACC TGTTCCTGTA G (SEQ ID NO:37) 
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Figure 19. 

Nucleic acid sequence FP6 (E57)-NA 



ATGGCCTCCTCCGAGAACGTCATCACCGAGTTCATGCGCTTCAAGGTGCGCATGGAGGGCACCGTGA 

ACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCCACAACACCGTG 

AAGCTGAAGGTGACCAAGGGCGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCCCAGTTCCAGT 

ACGGCTCCAAGGTGTACGTGAAGCACCCCGCCGACATCCCCGACTACAAGAAGCTGTCCTTCCCCGA 

GGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGCGACCGTGACCCAGGACTC 

CTCCCTGCAGGACGGCTGCTTCATCTACAAGGTGAAGTTCATCGGCGTGAACTTCCCCTCCGACGGC 

CCCGTGATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCACCGAGCGCCTGTACCCCCGCGACGGC 

GTGCTGAAGGGCGAGACCCACAAGGCCCTGAAGCTGAAGGACGGCGGCCACTACCTGGTGGAGTTC 

AAGTCCATCTACATGGCCAAGAAGCCCGTGCAGCTGCCCGGCTACTACTACGTGGACGCCAAGCTGG 

ACATCACCTCCCACAACGAGGACTACACCATCGTGGAGCAGTACGAGCGCACCGAGGGCCGCCACCA 

CCTGTTCCTG (SEQ ID NO.38) 
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Figure 20. 

Non-aggregating mutant FP7-NA was generated from M35-5 (FP7a). In comparison with M35-5, 
FP7-NA contains two additional substitutions - K6T and K7E. Nucleotide substitutions in the 
codon for Leu-4 were introduced to optimize codon usage (double underline). 

Cloning into pQE30 was done using BamHI and Hindlll sites: 

GGA TCC GCC TCC £Tg CTG ACC GAG ACC ATG CCC TTC AGG ACC ACC ATC 
BamHI ASLLTETMPFRTTI 

GAG GGC ACC GTG AAC GGC CAC TAC TTC AAG TGC ACC GGC AAG GGC GAG 
EGTVNGHYFKCTGKGE 

GGC AAC CCC CTC GAG GGC ACC CAG GAG ATG AAG ATC GAG GTG ATC GAG 
GNPLEGTQEMKIEVIE 

GGC GGC CCC CTG CCC TTC GCC TTC CAC ATC CTG TCC ACC TCC TGC ATG 
GGPLPFAFHILSTSCM 

TAC GGC TCC AAG GCC TTC ATC AAG TAC GTG TCC GGC ATC CCC GAC TAC 
YGSKAFIKYVSGIPDY 

TTC AAG CAG TCC CTC CCC GAG GGC TTC ACC TGG GAG CGC ACC ACC ACC 
FKQSLPEGFTWERT TT 

TAC GAG - GAC GGC GGC TTC CTG ACC GCC CAC CAG GAC ACC TCC CTG GAC 
YEDGGFLTAHQDTSLD 

GGC GAC TGC CTG GTG TAC AAG GTG AAG ATC CTG GGC AAC AAC TTC CCC 
GDCLVYKVKI LGNNFP 

GCC GAC GGC CCC GTG ATG CAG AAC AAG GCC GGC CGC TGG GAG CCC TCC 
ADGPVMQNKAGRWEPS 

ACC GAG ATC GTG TAC GAG GTG GAC GGC GTG CTG CGC GGC CAG TCC CTG 
TEIVYEVDGVLRGQSL 

ATG GCC CTG GAG TGC CCC GGC GGT CGC CAC CTG ACC TGC CAC CTG CAC 
MALECPGGRHLTCHLH 

ACC ACC TAC CGC TCC AAG AAG CCC GCC TCC GCC CTG AAG ATG CCC GGC 
TTYRSKKPASALKMPG 

TTC CAC TTC GAG GAC CAC CGC ATC GAG ATC CTG GAG GAG GTG GAG AAG 
FHFEDHRIEILEEVEK 

GGC AAG TGC TAC AAG CAG TAC GAG GCC GCC GTG GGC CGC TAC TGC GAC 
GKCYKQYEAAVGRYCD 




GCC GCC CCC TCC AAG CTG GGC CAC AAC TG AAGCTT 
AAPSKLGHN* Hindi I I (SEQ ID NO:39 & 40) 
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FIGURE 21 

ATG GCC TCC TTC CTG AAG AAG ACC ATG CCC TTC AAG ACC ACC ATC GAG 



M 


A 


S 


F 


L 


K 


K 


T 


M 


P 


F 


K 


T 


T 


I 


E 


GGC 
G 


ACC 
T 


GTG 
V 


AAC 
N 


GGC 
G 


CAC 
H 


TAC 
Y 


TTC 
F 


AAG 
K 


TGC 
C 


ACC 
T 


GGC 
G 


AAG 
K 


GGC 
G 


GAG 
E 


GGC 
G 


AAC 
N 


CCC 

p 


TTC 

F 


GAG 
E 


GGC 
G 


ACC 
T 


CAG 
Q 


GAG 
E 


ATG 
M 


AAG 

K 


ATC 
I 


GAG 
E 


GTG 
V 


ATC 
I 


GAG 
E 


GGC 
G 


GGC 
G 


CCC 

p 


CTG 
L 


CCC 
P 


TTC 
F 


GCC 
A 


TTC 
F 


CAC 
H 


ATC 
I 


CTG 
L 


TCC 
S 


ACC 
T 


TCC 
S 


TGC 
C 


ATG 
M 


TAC 
Y 


GGC 
G 


TCC 
S 


AAG 
K 


GCC 
A 


TTC 
F 


ATC 
I 


AAG 
K 


TAC 
Y 


GTG 

V 


TCC GGC ATC CCC GAC TAC TTC 
S G I P D Y F 


AAG 
K 


CAG 
Q 


TCC 
S 


TTC 

F 


CCC 

p 


GAG 
E 


GGC 
G 


TTC 
F 


ACC 
T 


TGG 
W 


GAG 
E 


CGC 
R 


ACC 
T 


ACC 
T 


ACC 
T 


TAC 
Y 


GAG 
E 


GAC 
D 


GGC 
G 


GGC 
G 


TTC 

F 


CTG 
L 


ACC 
T 


GCC 
A 


CAC 
H 


CAG 
Q 


GAC 
D 


ACC 
T 


TCC 
S 


CTG 
L 


GAC 
D 


GGC 
G 


GAC 
D 


TGC 
C 


CTG 
L 


GTG 
V 


TAC 
Y 


AAG 
K 


GTG 

V 


AAG 
K 


ATC 
I 


CTG 
L 


GGC 
G 


AAC 
N 


AAC 
N 


TTC 
F 


CCC 
P 


GCC 
A 



GAC GGC CCC GTG 
D G P V 

GAG ATC GTG TAC 
E I V Y 

GCC CTG AAG TGC 
A L K C 

ACC TAC CGC TCC 
T Y R S 



ATG CAG AAC AAG 

M Q N K 

GAG GTG GAC GGC 

E V D G 

CCC G GC GGC CGC 

P G G R 

AAG AAG CCC GCC 

K K P A 



GCC GGC CGC TGG 

A G R W 

GTG CTG CGC GGC 

V L R G 

CAC CTG ACC TGC 

H L T C 

TCC GCC CTG AAG 

S A L K 



GAG CCC TCC ACC 

E P S T 

CAG TCC CTG ATG 

Q S L M 

CAC CTG CAC ACC 

H L H T 

ATG CCC GGC TTC 

M P G F 



CAC TTC GAG GAC CAC CGC ATC GAG 
HFEDHRIE 

AAG TGC TAC AAG CAG TAC GAG GCC 
KCYKQYEA 

GCC CCC TCC AAG CTG GGC CAC AAC 
A PSK LGH N 



ATC ATG GAG GAG GTG GAG AAG GGC 
IMEEVEKG 

GCC GTG GGC CGC TAC TGC GAC GCC 
AVGRYCDA 

TgA 

* (SEQ ID NO: 41 & 42) 
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Figure 22 

Sequence of humanized 6/9 hybrid gene and 6/9-Q3 mutant 

for 6/9-2G and 6/9-Q3 CAG(Q) 

1 ATG AGC TGC AGC AAG AAC GTG ATC AAG GAG TTC ATG CGG TTC AAG GTG 48 

1MSCSKNVIKEFMRFKV 16 

4 9 CGG ATG GAG GGC ACC GTG AAC GGC CAC GAG TTC GAG ATC AAG GGC GAG 96 

17RMEGTVNGHEFEIKGE 32 

97 GGC GAG GGC CGG CCC TAC GAG GGC CAC TGC AGC GTG AAG CTC ATG GTG 144 

33 GEGRPYEGHCSVKLMV 48 

145 ACC AAG GGC GGC CCC CTC CCC TTC GCC TTC GAC ATC CTC AGC CCC CAG 192 

49 TKGG PLPFAFD I LS PQ 64 

193 TTC CAG TAC GGC AGC AAG GTG TAC GTG AAG CAC CCC GCC GAC ATC CCC 24 0 

65FQYGSKVYVKHPADIP 80 

ATG (M) for 6/9 -Q3 

TTC CCC GAG GGC TTC AAG TGG GAG CGG GTG 288 

FPEGFKWERV 96 

GGC GTG GTG ACC GTG AGC CAG GAC AGC AGC 3 36 

GVVTVSQDSS 112 

ATC TAC GAG GTG AAG TTC ATC GGC GTG AAC 384 

IYEVKFIGVN 128 

GTG ATG CAG CGG CGG ACC CGG GGC TGG GAG 432 

VMQRRTRGWE 144 

TAC CCC CGG GAC GGC GTG CTC AAG GGC GAC 480 

YPRDGVLKGD 160 

CTC GAG GGC GGC GGC CAC TAC CTC GTG GAG 52 8 

LEGGGHYLVE 176 

GCC AAG AAG CCC GTG CAG CTC CCC GGC TAC 576 

AKKPVQLPGY 192 

CTC GAC ATC ACC AGC CAC AAC GAG GAC TAC 624 

LDITSHNEDY 208 
TCC(S) for 6/9-2G and 6/9-Q3 

GAG CGG ACC GAG GGC CGG CAC CAC CTC TTC 6 72 

ERTEGRHHLF 224 

678 
226 



241 GAC TAC AAG AAG CTC AGC 

81 D Y K K L S 

289 ATG AAC TTC GAG GAC GGC 

97 M N F E D G 

33 7 CTC AAG GAC GGC TGC TTC 

113 L K D G C F 



fij 3 85 TTC CCC AGC GAC GGC CCC 

=1 129 F P S D G P 



43 3 GCC AGC AGC GAG CGG CTC 

^* 145 A S S E R L 

□ 481 ATC CAC ATG GCC CTC CGG 

=p 161 I H M A L R 

O 

fe i 52 9 TTC AAG AGC ATC TAC ATG 

177 F K S I Y M 

577 TAC TAC GTG GAC AGC AAG 

193 Y Y V D S K 

625 ACC ATC GTG GAG CAG TAC 

209 T I V E Q Y 

6 73 CTC TGA 

225 L * 



(SEQ ID NO: 43 & 44) 



